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1 CGATGTACGG GCCAGATATA CGCGTTGACA TTGATTATTG ACTAGTTATT 
GCTACATGCC CGGTCTATAT GCGCAACTGT AACTAATAAC TGATCAATAA 
51 AATAGTAATC AATTACGGGG TCATTAGTTC ATAGCCCATA TATGGAGTTC 
TTATCATTAG TTAATGCCCC AGTAATCAAG TATCGGGTAT ATACCTCAAG 
101 CGCGTTACAT AACTTACGGT AAATGGCCCG CCTGGCTGAC CGCCCAACGA 
GCGCAATGTA TTGAATGCCA TTTACCGGGC GGACCGACTG GCGGGTTGCT 
151 CCCCCGCCCA TTGACGTCAA TAATGACGTA TGTTCCCATA GTAACGCCAA 
GGGGGCGGGT AACTGCAGTT ATTACTGCAT ACAAGGGTAT CATTGCGGTT 

2 01 TAGGGACTTT CCATTGACGT CAATGGGTGG ACTATTTACG GTAAACTGCC 

ATCCCTGAAA GGTAACTGCA GTTACCCACC TGATAAATGC CATTTGACGG 
251 CACTTGGCAG TACATCAAGT GTATCATATG CCAAGTACGC CCCCTATTGA 
GTGAACCGTC ATGTAGTTCA CATAGTATAC GGTTCATGCG GGGGATAACT 

3 01 CGTCAATGAC GGTAAATGGC CCGCCTGGCA TTATGCCCAG TACATGACCT 

GCAGTTACTG CCATTTACCG GGCGGACCGT AATACGGGTC ATGTACTGGA 

3 51 TATGGGACTT TCCTACTTGG CAGTACATCT ACGTATTAGT CATCGCTATT 

ATACCCTGAA AGGATGAACC GTCATGTAGA TGCATAATCA GTAGCGATAA 

4 01 ACCATGGTGA TGCGGTTTTG GCAGTACATC AATGGGCGTG GATAGCGGTT 

TGGTACCACT ACGC CAAAAC CGTCATGTAG TTACCCGCAC CTATCGCCAA 
451 TGACTCACGG GGATTTCCAA GTCTCCACCC CATTGACGTC AATGGGAGTT 

ACTGAGTGCC CCTAAAGGTT CAGAGGTGGG GTAACTGCAG TTACCCTCAA 
501 TGTTTTGGCA CCAAAATCAA CGGGACTTTC CAAAATGTCG TAACAACTCC 

AC AAAAC CGT GGTTTTAGTT GCCCTGAAAG GTTTTACAGC ATTGTTGAGG 
551 GCCCCATTGA CGCAAATGGG CGGTAGGCGT GTACGGTGGG AGGTCTATAT 

CGGGGTAACT GCGTTTACCC GCCATCCGCA CATGCCACCC TCCAGATATA 
6 01 AAGCAGAGCT CTCTGGCTAA CTAGAGAACC CACTGCTTAC TGGCTTATCG 

TTCGTCTCGA GAGACCGATT GATCTCTTGG GTGACGAATG AC CGAATAGC 

Chi220 Leader 



Kpnl 



M D W • 

651 AAATTAATAC GACTCACTAT AGGGAGACCC AAGCTTGGTA CCATGGACTG 
TTTAATTATG CTGAGTGATA TCCCTCTGGG TTCGAACCAT GGTACCTGAC 

Chi22 0 Leader 



BamHI 



•TWR ILFL VAA ATG A H S E • 
701 GACCTGGAGG ATCCTCTTCT TGGTGGCAGC AGCAACAGGT GCCCACTCCG 

CTGGACCTCC TAGGAGAAGA ACCACCGTCG TCGTTGTCCA CGGGTGAGGC 

• VQL VES GGGL VQP GGS 
751 AAGTACAACT GGTGGAGTCT GGAGGAGGTT TGGTGCAACC TGGGGGTTCT 

TTCATGTTGA CCACCTCAGA CCTCCTCCAA ACCACGTTGG ACCCCCAAGA 

CDR1 



LRLS CAA SGF TFSD Y W M * 

801 CTGCGACTCT CTTGTGCAGC CTCGGGATTC ACTTTCAGTG ACTACTGGAT 

GACGCTGAGA GAACACGTCG GAGCCCTAAG TGAAAGTCAC TGATGAC CTA 

CDR1 CDR2 

f^f «-K> *S* >V 

•SWV RQAP GKG LEW V A D I • 
851 GAGCTGGGTT CGTCAGGCGC CTGGAAAGGG CCTGGAGTGG GTTGCAGATA 
CTCGACCCAA GCAGTCCGCG GACCTTTCCC GGACCTCACC CAACGTCTAT 
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CDR2 



KND GSY TNYA PSL TNR 
901 TTAAAAATGA TGGCAGTTAC ACAAACTATG CACCATCCCT AACGAATCGA 
AATTTTTACT ACCGTCAATG TGTTTGATAC GTGGTAGGGA TTGCTTAGCT 

PstI 



FTIS RDN AKN SLYL Q M N • 
951 TTCACAATCT CCAGAGACAA TGCCAAGAAC TCCCTGTACC TGCAGATGAA 
AAGTGTTAGA GGTCTCTGTT ACGGTTCTTG AGGGACATGG ACGTCTACTT 

CDR3 



• SLR AEDT AVY YCA R E L T • 
1001 CTCTCTGAGA GCTGAGGACA CAGCCGTTTA TTACTGTGCT AGAGAACTAA 
GAGAGACTCT CGACTCCTGT GTCGGCAAAT AATGACACGA TCTCTTGATT 
CDR3 



Nhel 



GTW GQG TMVT VSS AST 
1051 CTGGGACTTG GGGC CAAGGA ACCATGGTCA CAGTCTCCTC AGCTAGCACC 
GACCCTGAAC CCCGGTTCCT TGGT AC CAGT GTCAGAGGAG TCGATCGTGG 
KGPS VFP LAP GSRS T S E ♦ 
1101 AAGGGCCCAT CCGTCTTCCC CCTGGCGCCC TGCTCCAGGA GCACCTCCGA 
TTCCCGGGTA GGCAGAAGGG GGACCGCGGG ACGAGGTCCT CGTGGAGGCT 

Age I 



• S T A ALGC LVK DYF P E P V • 
1151 GAGCACAGCC GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG 

CTCGTGTCGG CGGGACCCGA CGGACCAGTT CCTGATGAAG GGGCTTGGCC 
Age I 

• TVS WNS GALT SGV HTF 
12 01 TGACGGTGTC GTGGAACTCA GGCGCCCTGA CCAGCGGCGT GCACACCTTC 

ACTGCCACAG CACCTTGAGT CCGCGGGACT GGTCGCCGCA CGTGTGGAAG 
PAVL QSS GLY SLSS V V T - 

12 51 CCGGCTGTCC TACAGTCCTC AGGACTCTAC TCCCTCAGCA GCGTGGTGAC 

GGCCGACAGG ATGTCAGGAG TCCTGAGATG AGGGAGTCGT CGCACCACTG 
•VPS SSLG TKT YTC N V D H * 

13 01 CGTGCCCTCC AGCAGCTTGG GCACGAAGAC CTACACCTGC AACGTAGATC 

GCACGGGAGG TCGTCGAACC CGTGCTTCTG GATGTGGACG TTGCATCTAG 
KPS NTK VDKR VES KYG 
13 51 ACAAGCCCAG CAACACCAAG GTGGACAAGA GAGTTGAGTC CAAATATGGT 
TGTTCGGGTC GTTGTGGTTC CACCTGTTCT CTCAACTCAG GTTTATACCA 
PPCP PCP APE FLGG P S V ♦ 
1401 CCACCTTGCC CACCTTGCCC AGCACCTGAG TTCCTGGGGG GACCATCAGT 
GGTGGAACGG GTGGAACGGG TCGTGGACTC AAGGACCCCC CTGGTAGTCA 

• F L F PPKP KDT LMI SRTP- 
1451 CTTCCTGTTC CCCCCAAAAC CCAAGGACAC TCTCATGATC TCCCGGACCC 

GAAGGACAAG GGGGGTTTTG GGTTCCTGTG AGAGTACTAG AGGGCCTGGG 
EVT CVV VDVS QED PEV 
1501 CTGAGGTCAC GTGCGTGGTG GTGGACGTGA GCCAGGAAGA CCCCGAGGTC 
GACTCCAGTG CACGCACCAC CACCTGCACT CGGTCCTTCT GGGGCTCCAG 
QFNW YVD GVE VHNA KTK- 
1551 CAGTTCAACT GGTACG TGGA TGGCGTGGAG GTGCATAATG CCAAGACAAA 
GTCAAGTTGA CCATGCACCT ACCGCACCTC CACGTATTAC GGTTCTGTTT 
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SacII 



• PRE EQFN STY R V V S V L T • 
1601 GCCGCGGGAG GAGCAGTTCA ACAGCACGTA CCGTGTGGTC AGCGTCCTCA 

CGGCGCCCTC CTCGTCAAGT TGTCGTGCAT GGCACACCAG TCGCAGGAGT 

• VLH QDW LNGK EYK CKV 
1651 CCGTCCTGCA CCAGGACTGG CTGAACGGCA AGGAGTACAA GTGCAAGGTC 

GGCAGGACGT GGTCCTGACC GACTTGCCGT TCCTCATGTT CACGTTCCAG 
SNKG LPS SIE KTIS K A K * 
17 01 TCCAACAAAG GCCTCCCGTC CTCCATCGAG AAAACCATCT CCAAAGCCAA 
AGGTTGTTTC CGGAGGGCAG GAGGTAGCTC TTTTGGTAGA GGTTTCGGTT 
•GQP REPQ VYT LPP S Q E E • 
1751 AGGGCAGCCC CGAGAGCCAC AGGTGTACAC CCTGCCCCCA TCCCAGGAGG 
TCCCGTCGGG GCTCTCGGTG TCCACATGTG GGACGGGGGT AGGGTCCTCC 

• MTK NQV SLTC LVK GFY 
1801 AGATGACCAA GAACCAGGTC AGCCTGACCT GCCTGGTCAA AGGCTTCTAC 

TCTACTGGTT CTTGGTCCAG TCGGACTGGA CGGACCAGTT TCCGAAGATG 
PSDI AVE WES NGQP E N N ♦ 
1851 CCCAGCGACA TCGCCGTGGA GTGGGAGAGC AATGGGCAGC CGGAGAACAA 
GGGTCGCTGT AGCGGCACCT CACCCTCTCG TTACCCGTCG GCCTCTTGTT 
•YKT TPPV LDS DGS FFLY- 
1901 CTACAAGACC ACGCCTCCCG TGCTGGACTC CGACGGCTCC TTCTTCCTCT 
GATGTTCTGG TGCGGAGGGC ACGACCTGAG GCTGCCGAGG AAGAAGGAGA 
SRL TVD KSRW QEG NVF 
1951 ACAGCAGGCT AACCGTGGAC AAGAGCAGGT GGCAGGAGGG GAATGTCTTC 
TGTCGTCCGA TTGGCACCTG TTCTCGTCCA CCGTCCTCCC CTTACAGAAG 
SCSV MHE ALH NHYT Q K S * 
2 001 TCATGCTCCG TGATGCATGA GGCTCTGCAC AACCACTACA CACAGAAGAG 
AGTACGAGGC ACTACGTACT CCGAGACGTG TTGGTGATGT GTGTCTTCTC 

Xbal 



• L S L S L G K 
2051 CCTCTCCCTG TCTCTGGGTA AATGATCTAG AGGGCCCTAT TCTATAGTGT 

GGAGAGGGAC AGAGACCCAT TTACTAGATC TCCCGGGATA AGATATCACA 
2101 CACCTAAATG CTAGAGCTCG CTGATCAGCC TCGACTGTGC CTTCTAGTTG 

GTGGATTTAC GATCTCGAGC GACTAGTCGG AGCTGACACG GAAGATCAAC 
2151 CCAGCCATCT GTTGTTTGCC CCTCCCCCGT GCCTTCCTTG ACCCTGGAAG 

GGTCGGTAGA CAACAAACGG GGAGGGGGCA CGGAAGGAAC TGGGACCTTC 
22 01 GTGCCACTCC CACTGTCCTT TCCTAATAAA ATGAGGAAAT TGCATCGCAT 

CACGGTGAGG GTGACAGGAA AGGATTATTT TACTCCTTTA ACGTAGCGTA 
2251 TGTCTGAGTA GGTGTCATTC TATTCTGGGG GGTGGGGTGG GGCAGGACAG 

ACAGACTCAT CCACAGTAAG ATAAGACCCC CCACCCCACC CCGTCCTGTC 
2301 CAAGGGGGAG GATTGGGAAG ACAATAGCAG GCATGCTGGG GATGCGGTGG 

GTTCCCCCTC CTAACCCTTC TGTTATCGTC CGTACGACCC CTACGCCACC 
2351 GCTCTATGGC TTCTGAGGCG GAAAGAACCA GCTGGGGCTC TAGGGGGTAT 

CGAGATACCG AAGACTCCGC CTTTCTTGGT CGACCCCGAG ATCCCCCATA 
24 01 CCCCACGCGC CCTGTAGCGG CGCATTAAGC GCGGCGGGTG TGGTGGTTAC 

GGGGTGCGCG GGACATCGCC GCGTAATTCG CGCCGCCCAC ACCACCAATG 
2451 GCGCAGCGTG ACCGCTACAC TTGCCAGCGC CCTAGCGCCC GCTCCTTTCG 

CGCGTCGCAC TGGCGATGTG AACGGTCGCG GGATCGCGGG CGAGGAAAGC 
2501 CTTTCTTCCC TTCCTTTCTC GCCACGTTCG CCGGGCCTCT CAAAAAAGGG 

GAAAGAAGGG AAGGAAAGAG CGGTGCAAGC GGCCCGGAGA GTTTTTTCCC 
2551 AAAAAAAGCA TGCATCTCAA TTAGTCAGCA ACCATAGTCC CGCCCCTAAC 

TTTTTTTCGT ACGTAGAGTT AATCAGTCGT TGGTATCAGG GCGGGGATTG 
2601 TCCGCCCATC CCGCCCCTAA CTCCGCCCAG TTCCGCCCAT TCTCCGCCCC 

AGGCGGGTAG GGCGGGGATT GAGGCGGGTC AAGGCGGGTA AGAGGCGGGG 
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2651 ATGGCTGACT AATTTTTTTT 

TACCGACTGA TTAAAAAAAA 
2 7 01 TCTGAGCTAT TCCAGAAGTA 

AGACTCGATA AGGTCTTCAT 
2 751 TTGCAAAAAG CTTGGACAGC 

AACGTTTTTC GAACCTGTCG 
2 801 GGCAATCCTA GCGTGAAGGC 

CCGTTAGGAT CGCACTTCCG 
2 851 GTTCGACCAT TGAACTGCAT 

CAAGCTGGTA ACTTGACGTA 
2 901 CAAGAACGGA GACCTACCCT 

GTTCTTGCCT CTGGATGGGA 

2 951 TCCAAAGAAT GACCACAACC 

AGGTTTCTTA CTGGTGTTGG 
3001 ATTATGGGTA GGAAAACCTG 
TAATACCCAT CCTTTTGGAC 

3 051 AAAGGACAGA ATTAATATAG 

TTTCCTGTCT TAATTATATC 
3101 GAGGAGCTCA TTTTCTTGCC 

CTCCTCGAGT AAAAGAACGG 
3151 GAACAACCGG AATTGGCAAG 

CTTGTTGGCC TTAACCGTTC 
3 2 01 CAGTTCTGTT TACCAGGAAG 

GTCAAGACAA ATGGTCCTTC 

32 51 TTGTGACAAG GATCATGCAG 

AACACTGTTC CTAGTACGTC 

33 01 ATTGATTTGG GGAAATATAA 

TAACTAAACC CCTTTATATT 
3351 TGAGGTC CAG GAGGAAAAAG 

ACTCCAGGTC CTCCTTTTTC 
3401 AGAAAGACTA ACAGGAAGAT 

TCTTTCTGAT TGTCCTTCTA 
3451 CTATGCATTT TTATAAGACC 

GATACGTAAA AATATTCTGG 
3 501 GTGAAGGAAC CTTACTTCTG 

CACTTCCTTG GAATGAAGAC 
3 551 AGAGATTTAA AGCTCTAAGG 

TCTCTAAATT TCGAGATTCC 
3 601 TTAAACTACT GATTCTAATT 

AATTTGATGA CTAAGATTAA 
3 651 ACTGATGAAT GGGAGCAGTG 

TGACTACTTA CCCTCGTCAC 
3 701 TTGCTCAGAA GAAATGC CAT 

AACGAGTCTT CTTTACGGTA 
3 751 AACATTCTAC TCCTCCAAAA 

TTGTAAGATG AGGAGGTTTT 
3 801 TTTCCTTCAG AATTGCTAAG 

AAAGGAAGTC TTAACGATTC 
3 851 AACTCTTGCT TGCTTTGCTA 

TTGAGAACGA ACGAAACGAT 
3 901 TATACAAGAA AATTATGGAA 

ATATGTTCTT TTAATACCTT 

3 951 AACAGTTATA ATCATAACAT 

TTGTCAATAT TAGTATTGTA 

4 001 AGTGTCTGCT ATTAATAACT 

TCACAGACGA TAATTATTGA 



ATTTATGCAG AGGCCGAGGC CGCCTCGGCC 
TAAATACGTC TCCGGCTCCG GCGGAGCCGG 
GTGAGGAGGC TTTTTTGGAG GCCTAGGCTT 
CACTCCTCCG AAAAAACCTC CGGATCCGAA 
TCAGGGCTGC GATTTCGCGC CAAACTTGAC 
AGTCCCGACG CTAAAGCGCG GTTTGAACTG 
TGGTAGGATT TTATCCCCGC TGC CATCATG 
ACCATCCTAA AATAGGGGCG ACGGTAGTAC 
CGTCGCCGTG TCCCAAAATA TGGGGATTGG 
GCAGCGG CAC AGGGTTTTAT ACCCCTAACC 
GGCCTCCGCT CAGGAACGAG TTCAAGTACT 
CCGGAGGCGA GTCCTTGCTC AAGTTCATGA 
TCTTCAGTGG AAGGTAAACA GAATCTGGTG 
AGAAGTCACC TTCCATTTGT CTTAGACCAC 
GTTCTCCATT CCTGAGAAGA ATCGACCTTT 
CAAGAGGTAA GGACTCTTCT TAGCTGGAAA 
TTCTCAGTAG AGAACTCAAA GAACCACCAC 
AAGAGTCATC TCTTGAGTTT CTTGGTGGTG 
AAAAGTTTGG ATGATGCCTT AAGACTTATT 
TTTTCAAACC TACTACGGAA TTCTGAATAA 
TAAAGTAGAC ATGGTTTGGA TAGT CGGAGG 
ATTTCATCTG TACCAAACCT ATCAGCCTCC 
CCATGAATCA ACCAGGCCAC CTTAGACTCT 
GGTACTTAGT TGGTCCGGTG GAATCTGAGA 
GAATTTGAAA GTGACACGTT TTTCCCAGAA 
CTTAAACTTT CACTGTGCAA AAAGGGTCTT 
ACTTCTCCCA GAATACCCAG GCGTCCTCTC 
TGAAGAGGGT CTTATGGGTC CGCAGGAGAG 
GCATCAAGTA TAAGTTTGAA GTCTACGAGA 
CGTAGTTCAT ATTCAAACTT CAGATGCTCT 
GCTTTCAAGT TCTCTGCTCC CCTCCTAAAG 
CGAAAGTTCA AGAGACGAGG GGAGGATTTC 
ATGGGACTTT TGCTGGCTTT AGATCTCTTT 
TACCCTGAAA ACGACCGAAA TCTAGAGAAA 
TGGTGTGACA TAATTGGACA AACTACCTAC 
ACCACACTGT ATTAACCTGT TTGATGGATG 
TAAATATAAA ATTTTTAAGT GTATAATGTG 
ATTTATATTT TAAAAATTCA CATATTACAC 
GTTTGTGTAT TTTAGATTCC AACCTATGGA 
CAAACACATA AAATCTAAGG TTGGATACCT 
GTGGAATGCC TTTAATGAGG AAAACCTGTT 
CACCTTACGG AAATTACTCC TTTTGGACAA 
CTAGTGATGA TGAGGCTACT GCTGACTCTC 
GATCACTACT ACTCCGATGA CGACTGAGAG 
AAGAAGAGAA AGGTAGAAGA CCCCAAGGAC 
TTCTTCTCTT TCCATCTTCT GGGGTTCCTG 
TTTTTTGAGT CATGCTGTGT TTAGTAATAG 
AAAAAACTCA GTACGACACA AATCATTATC 
TTT AC AC CAC AAAGGAAAAA GCTGCACTGC 
AAATGTGGTG TTTCCTTTTT CGACGTGACG 
AAATATTCTG TAACCTTTAT AAGTAGGCAT 
TTTATAAGAC ATTGGAAATA TTCATCCGTA 
ACTGTTTTTT CTTACTCCAC ACAGGCATAG 
TGACAAAAAA GAATGAGGTG TGTCCGTATC 
ATGCTCAAAA ATTGTGTACC TTTAGCTTTT 
TACGAGTTTT TAACACATGG AAATCGAAAA 
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4 051 TAATTTGTAA AGGGGTTAAT AAGGAATATT TGATGTATAG TGCCTTGACT 
ATTAAACATT TCCCCAATTA TTCCTTATAA ACTACATATC ACGGAACTGA 

4101 AGAGATCATA ATCAGCCATA CCACATTTGT AGAGGTTTTA CTTGCTTTAA 
TCTCTAGTAT TAGTCGGTAT GGTGTAAACA TCTCCAAAAT GAACGAAATT 

4151 AAAACCTCCC ACACCTCCCC CTGAACCTGA AACATAAAAT GAATGCAATT 
TTTTGGAGGG TGTGGAGGGG GACTTGGACT TTGTATTTTA CTTACGTTAA 

42 01 GTTGTTGTTA ACTTGTTTAT TGCAGCTTAT AATGGTTACA AATAAAGCAA 

CAACAACAAT TGAACAAATA ACGTCGAATA TT AC CAATGT TTATTTCGTT 
4251 TAGCATCACA AATTTCACAA ATAAAGCATT TTTTTCACTG CATTCTAGTT 
AT CGT AGTGT TTAAAGTGTT TATTTCGTAA AAAAAGTGAC GTAAGATCAA 

43 01 GTGGTTTGTC CAAACTCATC AATGTATCTT ATCATGTCTG GATCGGCTGG 

CACCAAACAG GTTTGAGTAG TTACATAGAA TAGTACAGAC CTAGCCGACC 
4 3 51 ATGATCCTCC AGCGCGGGGA TCTCATGCTG GAGTTCTTCG CCCACCCCAA 
TACTAGGAGG TCGCGCCCCT AGAGTACGAC CTCAAGAAGC GGGTGGGGTT 

44 01 CTTGTTTATT GCAGCTTATA ATGGTTACAA ATAAAGCAAT AGCATCACAA 

GAACAAATAA CGTCGAATAT TACCAATGTT TATTTCGTTA TCGTAGTGTT 
44 51 ATTTCACAAA TAAAGCATTT TTTTCACTGC ATTCTAGTTG TGGTTTGTCC 

TAAAGTGTTT ATTTCGTAAA AAAAGTGACG TAAGATCAAC ACCAAACAGG 
4501 AAACTCATCA ATGTATCTTA TCATGTCTGT ATACCGTCGA CCTCTAGCTA 

TTTGAGTAGT TACATAGAAT AGTACAGACA TATGGCAGCT GGAGATCGAT 
4551 GAGCTTGGCG TAATCATGGT CATAGCTGTT TCCTGTGTGA AATTGTTATC 

CTCGAACCGC ATTAGTACCA GTATCGACAA AGGACACACT TTAACAATAG 
4601 CGCTCACAAT TCCACACAAC ATACGAGCCG GAAGCATAAA GTGTAAAGCC 

GCGAGTGTTA AGGTGTGTTG TATGCTCGGC CTTCGTATTT CACATTTCGG 
4 651 TGGGGTGCCT AATGAGTGAG CTAACTCACA TTAATTGCGT TGCGCTCACT 

ACCCCACGGA TTACTCACTC GATTGAGTGT AATTAACGCA ACGCGAGTGA 
4701 GCCCGCTTTC CAGTCGGGAA ACCTGTCGTG CCAGCTGCAT TAATGAATCG 

CGGGCGAAAG GTCAGCCCTT TGGACAGCAC GGTCGACGTA ATTACTTAGC 
4 751 GCCAACGCGC GGGGAGAGGC GGTTTGCGTA TTGGGCGCTC TTCCGCTTCC 

CGGTTGCGCG CCCCTCTCCG CCAAACGCAT AACCCGCGAG AAGGCGAAGG 
4 8 01 TCGCTCACTG ACTCGCTGCG CTCGGTCGTT CGGCTGCGGC GAGCGGTATC 

AGCGAG TGAC TGAGCGACGC GAGCCAGCAA GCCGACGCCG CTCGCCATAG 
4 851 AGCTCACTCA AAGGCGGTAA TACGGTTATC CACAGAATCA GGGGATAACG 

TCGAGTGAGT TTCCGCCATT ATGCCAATAG GTGTCTTAGT CCCCTATTGC 
4 901 CAGGAAAGAA CATGTGAGCA AAAGGCCAGC AAAAGGCCAG GAACCGTAAA 

GTCCTTTCTT GTACACTCGT TTTCCGGTCG TTTTCCGGTC CTTGGCATTT 

4 951 AAGGCCGCGT TGCTGGCGTT TTTCCATAGG CTCCGCCCCC CTGACGAGCA 

TTCCGGCGCA ACGACCGCAA AAAGGTATCC GAGGCGGGGG GACTGCTCGT 

5 001 TCACAAAAAT CGACGCTCAA GTCAGAGGTG GCGAAACCCG ACAGGACTAT 

AGTGTTTTTA GCTGCGAGTT CAGTCTCCAC CGCTTTGGGC TGTCCTGATA 
5 051 AAAGATAC CA GGCGTTTCCC CCTGGAAGCT CCCTCGTGCG CTCTCCTGTT 

TTTCTATGGT CCGCAAAGGG GGACCTTCGA GGGAGCACGC GAGAGGACAA 
5101 CCGACCCTGC CGCTTACCGG ATACCTGTCC GCCTTTCTCC CTTCGGGAAG 

GGCTGGGACG GCGAATGGCC TATGGACAGG CGGAAAGAGG GAAGCCCTTC 
5151 CGTGGCGCTT TCTCAATGCT CACGCTGTAG GTATCTCAGT TCGGTGTAGG 

GCACCGCGAA AGAGTTACGA GTGCGACATC CATAGAGTCA AGCCACATCC 

52 01 TCGTTCGCTC CAAGCTGGGC TGTGTGCACG AACCCCCCGT TCAGCCCGAC 

AGCAAGCGAG GTTCGACCCG ACACACGTGC TTGGGGGGCA AGT CGGGCTG 
5251 CGCTGCGCCT TATCCGGTAA CTATCGTCTT GAGTCCAACC CGG TAAGACA 
GCGACGCGGA ATAGGC CATT GATAGCAGAA CTCAGGTTGG GCCATTCTGT 

53 01 CGACTTATCG CCACTGGCAG CAGCCACTGG TAACAGGATT AGCAGAGCGA 

GCTGAATAGC GGTGACCGTC GTCGGTGACC ATTGTCCTAA TCGTCTCGCT 
5 3 51 GGTATGTAGG CGGTGCTACA GAGTTCTTGA AGTGGTGGCC TAACTACGGC 

CCATACATCC GCCACGATGT CTCAAGAACT TCACCACCGG ATTGATGCCG 
5401 TACAC TAGAA GGACAGTATT TGGTATCTGC GCTCTGCTGA AGC CAGTTAC 

ATGTGATCTT CCTGTCATAA ACCATAGACG CGAGACGACT TCGGTCAATG 
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5451 C TTCGGAAAA AGAGTTGGTA GCTCTTGATC CGGCAAACAA ACCACCGCTG 

GAAGCCTTTT TCTCAACCAT CGAGAACTAG GCCGTTTGTT TGGTGGCGAC 
5501 GTAGCGGTGG TTTTTTTGTT TGCAAGCAGC AGATTACGCG CAGAAAAAAA 

CATCGCCACC AAAAAAACAA ACGTTCGTCG TCTAATGCGC GTCTTTTTTT 
5551 GGATCTCAAG AAGATCCTTT GATCTTTTCT ACGGGGTCTG ACGCTCAGTG 

CCTAGAGTTC TTCTAGGAAA CTAGAAAAGA TGCCCCAGAC TGCGAGTCAC 
5601 GAACGAAAAC TCACGTTAAG GGATTTTGGT CATGAGATTA T C AAAAAGG A 

CTTGCTTTTG AGTGCAATTC CCTAAAACCA GTACTCTAAT AGTTTTTCCT 
5651 TCTTCACCTA GATCCTTTTA AATTAAAAAT GAAGTTTTAA ATCAATCTAA 

AGAAGTGGAT CTAGGAAAAT TTAATTTTTA CTTCAAAATT TAGTTAGATT 
5701 AGTATATATG AGTAAACTTG GTCTGACAGT TACCAATGCT TAATCAGTGA 

TCATATATAC TCATTTGAAC CAGAC TGTCA ATGGTTACGA ATTAGTCACT 
5751 GGCACCTATC TCAGCGATCT GTCTATTTCG TTCATCCATA GTTGCCTGAC 

CCGTGGATAG AGTCGCTAGA CAGATAAAGC AAGTAGGTAT CAACGGACTG 
5 801 TCCCCGTCGT GTAGATAACT ACGATACGGG AGGGCTTACC ATCTGGCCCC 

AGGGGCAGCA CAT CT ATTGA TGCTATGCCC TCCCGAATGG TAGACCGGGG 
5851 AGTGCTGCAA TGATACCGCG AGACCCACGC TCACCGGCTC CAGATTTATC 

TCACGACGTT ACTATGGCGC TCTGGGTGCG AGTGGCCGAG GTCTAAATAG 
5 901 AGCAATAAAC CAGCCAGCCG GAAGGGCCGA GCGCAGAAGT GGTCCTGCAA 

TCGTTATTTG GTCGGTCGGC CTTCCCGGCT CGCGTCTTCA CCAGGACGTT 

5 951 CTTTATCCGC CTCCATCCAG TCTATTAATT GTTGC CGGGA AGCTAGAGTA 

GAAATAGGCG GAGGTAGGTC AGATAATTAA CAACGGCCCT TCGATCTCAT 
6001 AGTAGTTCGC CAGTTAATAG TTTGCGCAAC GTTGTTGCCA TTGCTACAGG 
TCATCAAGCG GTCAATTATC AAACGCGTTG CAACAACGGT AACGATGTCC 

6 051 CATCGTGGTG TCACGCTCGT CGTTTGGTAT GGCTTCATTC AGCTCCGGTT 

GTAGCACCAC AGTGCGAGCA GCAAACCATA CCGAAGTAAG TCGAGGCCAA 
6101 CCCAACGATC AAGGCGAGTT ACATGATCCC CCATGTTGTG CAAAAAAGCG 

GGGTTGCTAG TTCCGCTCAA TGTACTAGGG GGTACAACAC GTTTTTTCGC 
6151 GTTAGCTCCT TCGGTCCTCC GATCGTTGTC AGAAGTAAGT TGGCCGCAGT 

CAATCGAGGA AGCCAGGAGG CTAGCAACAG TCTTCATTCA ACCGGCGTCA 
62 01 GTTATCACTC ATGGTTATGG CAGCACTGCA TAATTCTCTT ACTGTCATGC 

CAATAGTGAG TACCAATACC GTCGTGACGT ATTAAGAGAA TGACAGTACG 

62 51 CATCCGTAAG ATGCTTTTCT GTGACTGGTG AGTACTCAAC CAAGTCATTC 

GTAGGCATTC TACGAAAAGA CACTGACCAC TCATGAGTTG GTTCAGTAAG 

63 01 TGAGAATAGT GTATGCGGCG ACCGAGTTGC TCTTGCCCGG CGTCAATACG 

ACTCTTATCA CATACGCCGC TGGCTCAACG AGAACGGGCC GCAGTTATGC 
63 51 GGATAATACC GCGCCACATA GCAGAACTTT AAAAGTGCTC ATCATTGGAA 

CCTATTATGG CGCGGTGTAT CGTCTTGAAA TTTTCACGAG TAGTAACCTT 
6401 AACGTTCTTC GGGGCGAAAA CTCTCAAGGA TCTTACCGCT GTTGAGATCC 

TTGCAAGAAG CCCCGCTTTT GAGAGTTCCT AGAATGGCGA CAACTCTAGG 
6451 AGTTCGATGT AACCCACTCG TGCACCCAAC TGATCTTCAG CATCTTTTAC 

TCAAGCTACA TTGGGTGAGC ACGTGGGTTG ACTAGAAGTC GTAGAAAATG 
6501 TTTCACCAGC GTTTCTGGGT GAGCAAAAAC AGGAAGGCAA AATGCCGCAA 

AAAGTGGTCG CAAAGACCCA CTCGTTTTTG TCCTTCCGTT TTACGGCGTT 
6551 AAAAGGGAAT AAGGGCGACA CGGAAATGTT GAATACTCAT ACTCTTCCTT 

TTTTCCCTTA TTCCCGCTGT GCCTTTACAA CTTATGAGTA TGAGAAGGAA 
6601 TTTCAATATT ATTGAAGCAT TTATCAGGGT TATTGTCTCA TGAGCGGATA 

AAAGTTATAA TAACTTCGTA AATAGTCCCA ATAACAGAGT ACTCGCCTAT 
6651 CATATTTGAA TGTATTTAGA AAAATAAACA AATAGGGGTT CCGCGCACAT 

GTATAAACTT ACATAAATCT TTTTATTTGT TTATCCCCAA GGCGCGTGTA 
6701 TTCCCCGAAA AGTGCCACCT GACGTCGACG GATCGGGAGA TCTGCTAGGT 

AAGGGGCTTT TCACGGTGGA CTGCAGCTGC CTAGCCCTCT AGACGATCCA 
AscI 



6751 GACCTGAGGC GCGCCGGCTT CGAATAGCCA GAGTAACCTT TTTTTTTAAT 
CTGGACTCCG CGCGGCCGAA GCTTATCGGT CTCATTGGAA AAAAAAATTA 
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6801 TTTATTTTAT TTTATTTTTG 

AAATAAAATA AAATAAAAAC 
6851 CCTATGGTCG ACTCTCAGTA 

GGATACCAGC TGAGAGTCAT 
6901 AGTATCTGCT CCCTGCTTGT 

TCATAGACGA GGGACGAACA 
6951 AAAATTTAAG CTACAACAAG 

TTTTAAATTC GATGTTGTTC 
7001 TCTGCTTAGG GTTAGGCGTT 

AGACGAATCC CAATCCGCAA 



AGATGGAGTT TGGCGCCGAT CTCCCGATCC 
TCTACCTCAA ACCGCGGCTA GAGGGCTAGG 
CAATCTGCTC TGATGCCGCA TAGTTAAGCC 
GTTAGACGAG ACTACGGCGT ATCAATTCGG 
GTGTTGGAGG TCGCTGAGTA GTGCGCGAGC 
CACAACCTCC AGCGACTCAT CACGCGCTCG 
GCAAGGCTTG ACCGACAATT GCATGAAGAA 
CGTTCCGAAC TGGCTGTTAA CGTACTTCTT 
TTGCGCTGCT TCG 
AACGCGACGA AGC 
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1 AATTACGGGG TCATTAGTTC ATAGCCCATA TATGGAGTTC CGCGTTACAT 

TTAATGCCCC AGTAATCAAG TATCGGGTAT ATACC TCAAG GCGCAATGTA 
51 AACTTACGGT AAATGGCCCG CCTGGCTGAC CGCCCAACGA CCCCCGCCCA 

TTGAATGCCA TTTACCGGGC GGACCGACTG GCGGGTTGCT GGGGGCGGGT 
101 TTGACGTCAA TAATGACGTA TGTTCCCATA GTAACGCCAA TAGGGACTTT 

AACTGCAGTT ATTACTGCAT ACAAGGGTAT CATTGCGGTT ATCCCTGAAA 
151 CCATTGACGT CAATGGGTGG ACTATTTACG GTAAACTGCC CACTTGGCAG 

GGTAACTGCA GTTACCCACC TGATAAATGC CATTTGACGG GTGAACCGTC 
2 01 TACATCAAGT GTATCATATG CCAAGTACGC CCCCTATTGA CGTCAATGAC 

ATGTAGTTCA CATAGTATAC GGTTCATGCG GGGGATAACT GCAGTTACTG 

2 51 GGTAAATGGC CCGCCTGGCA TTATGCCCAG TACATGACCT TATGGGACTT 

CCATTTACCG GGCGGACCGT AATACGGGTC ATGTACTGGA ATACC CTGAA 

3 01 TCCTACTTGG CAGTACATCT ACGTATTAGT CATCGCTATT ACCATGGTGA 

AGGATGAACC GTCATGTAGA TGCATAATCA GTAGCGATAA TGGTACCACT 
3 51 TGCGGTTTTG GCAGTACATC AATGGGCGTG GATAGCGGTT TGACTCACGG 

ACGCCAAAAC CGTCATGTAG TTACCCGCAC CTATCGCCAA ACTGAGTGCC 
401 GGATTTCCAA GTCTCCACCC CATTGACGTC AATGGGAGTT TGTTTTGGCA 

CCTAAAGGTT CAGAGGTGGG GTAACTGCAG TTACCCTCAA ACAAAACCGT 
451 CCAAAATCAA CGGGACTTTC CAAAATGTCG TAACAACTCC GCCCCATTGA 

GGTTTTAGTT GCCCTGAAAG GTTTTACAGC ATTGTTGAGG CGGGGTAACT 
501 CGCAAATGGG CGGTAGGCGT GTACGGTGGG AGGTCTATAT AAGCAGAGCT 

GCGTTTACCC GCCATCCGCA CATGCCACCC TCCAGATATA TTCGTCTCGA 
551 CTCTGGCTAA CTAGAGAAC C CACTGCTTAC TGGCTTATCG AAATTAATAC 

GAGACCGATT GATCTCTTGG GTGACGAATG AC CGAATAGC TTTAATTATG 

Kpnl 

«— 

M E A P A Q 
6 01 GACTCACTAT AGGGAGACCC AAGCTTGGTA CCATGGAAGC CCCAGCTCAG 
CTGAGTGATA TCCCTCTGGG TTCGAAC CAT GGTACCTTCG GGGTCGAGTC 
LLFL LLL WLP DTTG D I V • 
651 CTTCTCTTCC TCCTGCTACT CTGGCTCCCA GATACCACCG GAGACATTGT 
GAAGAGAAGG AGGACGATGA GACCGAGGGT CTATGGTGGC CTC TGTAAC A 
•MTQ SPDS LAV SLG E R A T • 
701 AATGAC C C AG TCTCCAGACT CCCTGGCTGT GTCACTAGGA GAGCGGGCCA 
TTACTGGGTC AGAGGTCTGA GGGAC CGAC A CAGTGATCCT CTCGCCCGGT 

CDR1 



INC KSS QSLL SSG NQK 
751 CTATAAACTG CAAGTCCAGT CAGAGTCTTT TATCCAGTGG AAACCAAAAG 
GATATTTGAC GTTCAGGTCA GTCTCAGAAA ATAGGTCACC TTTGGTTTTC 
CDR1 



NYLA WYQ QKP GQPP K L L • 

801 AACTATTTGG CCTGGTATCA GCAGAAACCA GGCCAGCCTC CTAAACTACT 

TTGATAAACC GGACCATAGT CGTCTTTGGT CCGGTCGGAG GATTTGATGA 

CDR2 



• I Y Y ASTR QSG VPD R F S G * 
851 GATCTACTAT GCATCCACTA GGCAATCAGG GGTCCCTGAT CGCTTCAGTG 
CTAGATGATA CGTAGGTGAT CCGTTAGTCC CCAGGGACTA GCGAAGTCAC 

PstI 



SGS GTD FTLT ISS LQA 
901 GCAGTGGATC TGGGACGGAC TTCACTCTGA CCATCAGCAG CCTGCAGGCT 
CGTCACCTAG ACCCTGCCTG AAGTGAGACT GGTAGTCGTC GGACGTCCGA 
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CDR3 

PstI EcoRV 

EDVA V Y Y CLQ YDRY PFT 
951 GAGGACGTGG CAGTCTATTA CTGCCTGCAG TATGACAGAT ATCCATTCAC 
CTCCTGCACC GTCAGATAAT GACGGACGTC ATACTGTCTA TAGGTAAGTG 
CDR3 

Xhol Xbal 

<~-* *W «-v »-^» 

♦ F G Q GTKL EIK R 
1001 GTTCGGCCAA GGGACGAAGT TGGAAATAAA ACGTAAGTCT CGAGTCTCTA 
CAAGCCGGTT CCCTGCTTCA ACCTTTATTT TGCATTCAGA GCTCAGAGAT 
Age I 



Xbal EcoRI 



1051 GATAAC CGGT CAATCGATTG GAATTCTAAA CTCTGAGGGG GTCGGATGAC 

CTATTGGCCA GTTAGCTAAC CTTAAGATTT GAGACTCCCC CAGCCTACTG 
1101 GTGGCCATTC TTTGCCTAAA GCATTGAGTT TACTGCAAGG TCAGAAAAGC 

CACCGGTAAG AAACGGATTT CGTAACTCAA ATGACGTTCC AGTCTTTTCG 
1151 ATGCAAAGCC CTCAGAATGG CTGCAAAGAG CTCCAACAAA ACAATTTAGA 

TACGTTTCGG GAGTCTTACC GACGTTTCTC GAGGTTGTTT TGTTAAATCT 
12 01 ACTTTATTAA GGAATAGGGG GAAGCTAGGA AGAAACTCAA AACATCAAGA 

TGAAATAATT CCTTATCCCC CTTCGATCCT TCTTTGAGTT TTGTAGTTCT 

12 51 TTTTAAATAC GCTTCTTGGT CTCCTTGCTA TAATTATCTG GGATAAGCAT 

AAAATTTATG CGAAGAACCA GAGGAACGAT ATTAATAGAC CCTATTCGTA 

13 01 GCTGTTTTCT GTCTGTCCCT AACATGCCCT GTGATTATCC GCAAACAACA 

CGACAAAAGA CAGACAGGGA TTGTACGGGA CACTAATAGG CGTTTGTTGT 
13 51 CACCCAAGGG CAGAACTTTG TTACTTAAAC ACCATCCTGT TTGCTTCTTT 

GTGGGTTCCC GTCTTGAAAC AATGAATTTG TGGTAGGACA AACGAAGAAA 
T VAA PSVF IFP PSD 
1401 CCTCAGGAAC TGTGGCTGCA CCATCTGTCT TCATCTTCCC GCCATCTGAT 

GGAGTCCTTG ACACCGACGT GGTAGACAGA AGTAGAAGGG CGGTAGACTA 
EQLK SGT ASV VCLL NNF 
1451 GAGCAGTTGA AATCTGGAAC TGCCTCTGTT GTGTGCCTGC TGAATAACTT 

CTCGTCAACT TTAGACCTTG ACGGAGACAA CACACGGACG ACTTATTGAA 

•YPR EAKV QWK VDN ALQS 
15 01 CTATCCCAGA GAGGCCAAAG TACAGTGGAA GGTGGATAAC GCCCTCCAAT 

GATAGGGTCT CTCCGGTTTC ATGTCACCTT CCACCTATTG CGGGAGGTTA 

• GNS QES VTEQ DSK DST 
1551 CGGGTAACTC CCAGGAGAGT GTCACAGAGC AGGACAGCAA GGACAGCACC 

GCCCATTGAG GGTCCTCTCA CAGTGTCTCG TCCTGTCGTT CCTGTCGTGG 
YSLS STL TLS KADY EKH 
1601 TACAGCCTCA GCAGCACCCT GACGCTGAGC AAAGCAGACT ACGAGAAACA 

ATGTCGGAGT CGTCGTGGGA CTGCGACTCG TTTCGTCTGA TGCTCTTTGT 

•KVY ACEV THQ GLS SPVT 
1651 CAAAGTCTAC GCCTGCGAAG TCACCCATCA GGGCCTGAGC TCGCCCGTCA 

GTTTCAGATG CGGACGCTTC AGTGGGTAGT CCCGGACTCG AGCGGGCAGT 

• KSF NRG EC 

17 01 CAAAGAGCTT CAACAGGGGA GAGTGTTAGA GGGAGAAGTG CCCCCACCTG 
GTTTCTCGAA GTTGTCCCCT CTCACAATCT CCCTCTTCAC GGGGGTGGAC 

1751 CTCCTCAGTT CCAGCCTGAC CCCCTCCCAT CCTTTGGCCT CTGACCCTTT 
GAGGAGTCAA GGTCGGACTG GGGGAGGGTA GGAAACCGGA GACTGGGAAA 

1801 TTCCACAGGG GACCTACCCC TATTGCGGTC CTCCAGCTCA TCTTTCACCT 
AAGGTGTCCC CTGGATGGGG ATAACGCCAG GAGGTCGAGT AGAAAGTGGA 
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1851 CACCCCCCTC CTCCTCCTTG 

GTGGGGGGAG GAGGAGGAAC 
1901 GAATAAATAA AGTGAATCTT 

CTTATTTATT TCACTTAGAA 
1951 AATAATTATT ATCTGTTGTT 

TTATTAATAA TAGACAACAA 
2 001 ACTAAATATG TAGTCATCCT 

TGATTTATAC ATCAGTAGGA 
2051 TTCATTCTAT TTTACCCTAT 

AAGTAAGATA AAATGGGATA 
2101 CCACAAGCCT TCTGTCCTCA 

GGTGTTCGGA AGACAGGAGT 
2151 GCTTCCTTGT TTTCCCCTCC 

CGAAGGAACA AAAGGGGAGG 
22 01 TGACAGGTCT TACAGTCATA 

ACTGTCCAGA ATGTCAGTAT 
2251 CCAAAGCAAA TTTTTCAAAA 

GGTTTCGTTT AAAAAGTTTT 
2301 TTGCAACATG ATATAAAATA 

AACGTTGTAC TATATTTTAT 
2351 ACAATAGGGA AATGTTTAAG 

TGTTATCCCT TTACAAATTC 
2401 TCATGCCTTA TTTACATTTT 

AGTACGGAAT AAATGTAAAA 
2451 AAGGGCCGTA TTGAGTACTT 

TTCCCGGCAT AACTCATGAA 
2501 GTGAGATTAA AAACATTCAT 

CACTCTAATT TTTGTAAGTA 



2 551 GAGACAAATA TATTCTATAA 

CTCTGTTTAT ATAAGATATT 
2 6 01 GTCCCCACCC ACCAAAAAAC 

CAGGGGTGGG TGGTTTTTTG 
2651 ACAAAAGCCA AAAATTGGAA 

TGTTTTCGGT TTTTAACCTT 
2 701 TATTAAACTG TGGTATGTTT 

ATAATTTGAC ACCATACAAA 
2 751 ACAAGCTACA ACTATACCTA 

TGTTCGATGT TGATATGGAT 
2 801 TTACATAAGA GAAACTCAAT 

AATGTATTCT CTTTGAGTTA 
2 8 51 ATATAAAGTT CAAAACCAGG 

TATATTTCAA GTTTTGGTCC 
2 901 TTACTCTTAG CTGGGGGTGG 

AATGAGAATC GACCCCCACC 

2 951 GGGCTTCTGG GGTCTTGGTA 

CCCGAAGACC CCAGAAC CAT 

3 001 TTATGATCTG TGCACTGTTC 

AATACTAGAC ACGTGACAAG 
3051 ACATAAAGAA CATCTTATAC 

TGTATTTCTT GTAGAATATG 
3101 TAGGTCAAGA CCACGCAGCT 

ATCCAGTTCT GGTGCGTCGA 
3151 TACCTGCCTA ATCCTGCCCT 

ATGGACGGAT TAGGACGGGA 



GCTTTAATTA TGCTAATGTT GGAGGAGAAT 
CGAAATTAAT ACGATTACAA CCTCCTCTTA 
TGCACCTGTG GTTTCTCTCT TTCCTCATTT 
ACGTGGACAC CAAAGAGAGA AAGGAGTAAA 
TTACCAACTA CTCAATTTCT CTTATAAGGG 
AATGGTTGAT GAGTTAAAGA GAATATTCCC 
AAGGCGCATA AC GATTTATA AAAATCATCC 
TTCCGCGTAT TGGTAAATAT TTTTAGTAGG 
CATCCTCTGC AAGACAGTCC TCCCTCAAAC 
GTAGGAGACG TTCTGTCAGG AGGGAGTTTG 
CAGTCCCCTG GGCCATGGTA GGAGAGACTT 
GTCAGGGGAC CCGGTACCAT CCTCTCTGAA 
TCAGCAAGCC CTCATAGTCC TTTTTAAGGG 
AGTCGTTCGG GAGTATCAGG AAAAATTCCC 
TATCCTTTGA TTCAATTCCC TGAGAATCAA 
ATAGGAAACT AAGTTAAGGG ACTCTTAGTT 
GAAGAAACCT GCTATAAAGA GAATCATTCA 
CTTCTTTGGA CGATATTTCT CTTAGTAAGT 
ACAACACAAT AAAAGCAATT AAATAAACAA 
TGTTGTGTTA TTTTCGTTAA TTTATTTGTT 
TTCATCATGG TACTTAGACT TAATGGAATG 
AAGTAGTACC ATGAATCTGA ATTACCTTAC 
TAAACAGGTA CTGAGGGACT CCTGTCTGCC 
ATTTGTCCAT GACTCCCTGA GGACAGACGG 
TCCACAACCT AATTTAATCC ACACTATACT 
AGGTGTTGGA TTAAATTAGG TGTGATATGA 
TAAAATGTTG CAAAGGTTCT ATAAAGCTGA 
ATTTTACAAC GTTTCCAAGA TATTTCGACT 

Xbal 

CTCAGCAATC CCACTTCTAG ATGACTGAGT 
GAGTCGTTAG GGTGAAGATC TACTGACTCA 
TATGCAAGAA TGTTCAAAGC AGCTTTATTT 
ATACGTTCTT ACAAGTTTCG TCGAAATAAA 
ATAGCCCGAT TGTCCAACAA TAGAATGAGT 
TATCGGGCTA ACAGGTTGTT ATCTTACTCA 
ATACATTAGA ATACCCAATG AGGAGAATTA 
TATGTAATCT TATGGGTTAC TCCTCTTAAT 
CTCACACAGA TGAATCTCAT AAAAATAATG 
GAGTGTGTCT ACTTAGAGTA TTTTTATTAC 
GCAAAAGATA TGTTCTGTAT GTTTTCATCC 
CGTTTTCTAT ACAAGACATA CAAAAGTAGG 
TAAAAATAAA GTTAGAAATT TGGATGGAAA 
ATTTTTATTT CAATCTTTAA ACCTACCTTT 
GCGAGTTAGT GCCTGGGAGA AGACAAGAAG 
CGCTCAATCA CGGACCCTCT TCTGTTCTTC 
ATGTTCTGTT CCTCGTGTGG GGTTGTGCAG 
TACAAGACAA GGAGCACACC CCAACACGTC 
TGTATACACA TTATGCTTCA AAATAACTTC 
ACATATGTGT AATACGAAGT TTTATTGAAG 
CCAGTTAATA GATAGAAGAG GAATAAGTAA 
GGTCAATTAT CTATCTTCTC CTTATTCATT 
GGTAAGTGGG GGGGCCTGGG ATCAAATAGC 
CCATTCACCC CCCCGGACCC TAGTTTATCG 
CTTGAGCCCT GAATGAGTCT GCCTTCCAGG 
GAACTCGGGA CTTACTCAGA CGGAAGGTCC 
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32 01 GCTCAAGGTG CTCAACAAAA CAACAGGCCT GCTATTTTCC TGGCATCTGT 
CGAGTTCCAC GAGTTGTTTT GTTGTCCGGA CGATAAAAGG ACCGTAGACA 

Nhel 



3251 GCCCTGTTTG GCTAGCTAGG AGCACACATA CATAGAAATT AAATGAAACA 

CGGGACAAAC CGATCGATCC TCGTGTGTAT GTATCTTTAA TTTACTTTGT 
33 01 GACCTTCAGC AAGGGGACAG AGGACAGAAT TAACCTTGCC CAGACACTGG 

CTGGAAGTCG TTCCCCTGTC TCCTGTCTTA ATTGGAACGG GTCTGTGACC 
3351 AAACCCATGT ATGAACACTC ACATGTTTGG GAAGGGGGAA GGGCACATGT 

TTTGGGTACA TACTTGTGAG TGTACAAACC CTTCCCCCTT CCCGTGTACA 
3401 AAATGAGGAC TCTTCCTCAT TCTATGGGGC ACTCTGGCCC TGCCCCTCTC 

TTTACTCCTG AGAAGGAGTA AGATACCCCG TGAGACCGGG ACGGGGAGAG 
3451 AGCTACTCAT CCATCCAACA CACCTTTCTA AGTACCTCTC TCTGCCTACA 

TCGATGAGTA GGTAGGTTGT GTGGAAAGAT TCATGGAGAG AGACGGATGT 
3501 CTCTGAAGGG GTTCAGGAGT AACTAACACA GCATCCCTTC CCTCAAATGA 

GAGACTTCCC CAAGTCCTCA TTGATTGTGT CGTAGGGAAG GGAGTTTACT 
3551 CTGACAATCC CTTTGTCCTG CTTTGTTTTT CTTTCCAGTC AGTACTGGGA 

GACTGTTAGG GAAACAGGAC GAAACAAAAA GAAAGGTCAG TCATGACCCT 
3601 AAGTGGGGAA GGACAGTCAT GGAGAAACTA CATAAGGAAG CACCTTGCCC 

TTCACCCCTT CCTGTCAGTA CCTCTTTGAT GTATTCCTTC GTGGAACGGG 
3651 TTCTGCCTCT TGAGAATGTT GATGAGTATC AAATCTTTCA AACTTTGGAG 

AAGACGGAGA ACTCTTACAA CTACTCATAG TTTAGAAAGT TTGAAACCTC 
37 01 GTTTGAGTAG GGGTGAGACT CAGTAATGTC CCTTCCAATG ACATGAACTT 

CAAACTCATC CCCACTCTGA GTCATTACAG GGAAGGTTAC TGTACTTGAA 
3751 GCTCACTCAT CCCTGGGGGC CAAATTGAAC AATCAAAGGC AGGCATAATC 

CGAGTGAGTA GGGACCCCCG GTTTAACTTG TTAGTTTCCG TCCGTATTAG 

SacII 

EcoRI NotI Nhel BamHI 



3 8 01 CAGTTATGAA TTCTTGCGGC CGCTTGCTAG CTTCACGTGT TGGATC CAAC 
GTCAATACTT AAGAACGCCG GCGAACGATC GAAGTGCACA ACCTAGGTTG 
SacII Apal 

*—•* *W -~> "*s* 

3 851 CGCGGAAGGG CCCTATTCTA TAGTGTCACC TAAATGCTAG AGCTCGCTGA 
GCGCCTTCCC GGGATAAGAT ATCACAGTGG ATTTACGATC TCGAGCGACT 

3 901 TCAGCCTCGA CTGTGCCTTC TAGTTGCCAG CCATCTGTTG TTTGCCCCTC 
AGTCGGAGCT GACACGGAAG ATCAACGGTC GGTAGACAAC AAACGGGGAG 

3 951 CCCCGTGCCT TCCTTGACCC TGGAAGGTGC CACTCCCACT GTCCTTTCCT 

GGGGCACGGA AGGAACTGGG ACCTTCCACG GTGAGGGTGA CAGGAAAGGA 
4001 AATAAAATGA GGAAATTGCA TCGCATTGTC TGAGTAGGTG TCATTCTATT 
TTATTTTACT CCTTTAACGT AGCGTAACAG ACTCATCCAC AGTAAGATAA 

4 051 CTGGGGGGTG GGGTGGGGCA GGACAGCAAG GGGGAGGATT GGGAAGACAA 

GACCCCCCAC CCCACCCCGT CCTGTCGTTC CCCCTCCTAA CCCTTCTGTT 
4101 TAGCAGGCAT GCTGGGGATG CGGTGGGCTC TATGGCTTCT GAGGCGGAAA 

ATCGTCCGTA CGACCCCTAC GCCACCCGAG ATACCGAAGA CTCCGCCTTT 
4151 GAACCAGCTG GGGCTCTAGG GGGTATCCCC ACGCGCCCTG TAGCGGCGCA 

CTTGGTCGAC CCCGAGATCC CCCATAGGGG TGCGCGGGAC ATCGCCGCGT 

42 01 TTAAGCGCGG CGGGTGTGGT GGTTACGCGC AGCGTGACCG CTACACTTGC 

AATTCGCGCC GCCCACACCA CCAATGCGCG TCGCACTGGC GATGTGAACG 
4251 CAGCGCCCTA GCGCCCGCTC CTTTCGCTTT CTTCCCTTCC TTTCTCGCCA 
GTCGCGGGAT CGCGGGCGAG GAAAGCGAAA GAAGGGAAGG AAAGAGCGGT 

43 01 CGTTCGCCGG GCCTCTCAAA AAAGGGAAAA AAAGCATGCA TCTCAATTAG 

GCAAGCGGCC CGGAGAGTTT TTTCCCTTTT TTTCGTACGT AGAGTTAATC 
43 51 TCAGCAACCA TAGTCCCGCC CCTAACTCCG CCCATCCCGC CCCTAACTCC 
AGTCGTTGGT ATCAGGGCGG GGATTGAGGC GGGTAGGGCG GGGATTGAGG 
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44 01 GCCCAGTTCC GCCCATTCTC CGCCCCATGG CTGACTAATT TTTTTTATTT 

CGGGTCAAGG CGGGTAAGAG GCGGGGTACC GACTGATTAA AAAAAATAAA 
4451 ATGCAGAGGC CGAGGCCGCC TCGGCCTCTG AGCTATTCCA GAAGTAGTGA 

TACGTCTCCG GCTCCGGCGG AGCCGGAGAC TCGATAAGGT CTTCATCACT 
4501 GGAGGCTTTT TTGGAGGCCT AGGCTTTTGC AAAAAGCTTG GACAGCTCAG 

CCTCCGAAAA AACCTCCGGA TCCGAAAACG TTTTTCGAAC CTGTCGAGTC 
4551 GGCTGCGATT TCGCGCCAAA CTTGACGGCA ATCCTAGCGT GAAGGCTGGT 

CCGACGCTAA AGCGCGGTTT GAACTGCCGT TAGGATCGCA CTTCCGACCA 
4601 AGGATTTTAT CCCCGCTGCC ATCATGGTTC GACCATTGAA CTGCATCGTC 

TCC TAAAATA GGGGCGACGG TAGTACCAAG CTGGTAACTT GACGTAGCAG 
4 651 GCCGTGTCCC AAAATATGGG GATTGGCAAG AACGGAGACC TACCCTGGCC 

CGGCACAGGG TTTTATACCC CTAACCGTTC TTGCCTCTGG ATGGGACCGG 
47 01 TCCGCTCAGG AACGAGTTCA AGTACTTCCA AAGAATGACC ACAACCTCTT 

AGGCGAGTCC TTGCTCAAGT TCATGAAGGT TTCTTACTGG TGTTGGAGAA 
4751 CAGTGGAAGG TAAACAGAAT CTGGTGATTA TGGGTAGGAA AACCTGGTTC 

GTCACCTTCC ATTTGTCTTA GACCACTAAT ACCCATCCTT TTGGACCAAG 
4 801 TCCATTCCTG AGAAGAATCG ACCTTTAAAG GACAGAATTA ATATAGTTCT 

AGGTAAGGAC TCTTCTTAGC TGGAAATTTC CTGTCTTAAT TATATCAAGA 
4851 CAGTAGAGAA CTCAAAGAAC CACCACGAGG AGCTCATTTT CTTGCCAAAA 

GTGATCTCTT GAGTTTCTTG GTGGTGCTCC T CGAGTAAAA GAACGGTTTT 
4 901 GTTTGGATGA TGCCTTAAGA CTTATTGAAC AACCGGAATT GGCAAGTAAA 

CAAACCTACT ACGGAATTCT GAATAACTTG TTGGCCTTAA CCGTTCATTT 

4 951 GTAGACATGG TTTGGATAGT CGGAGGCAGT TCTGTTTACC AGGAAGC C AT 

CATCTGTACC AAACCTATCA GCCTCCGTCA AGACAAATGG TCCTTCGGTA 
5001 GAATCAAC C A GGCCACCTTA GACTCTTTGT GACAAGGATC ATGCAGGAAT 

CTTAGTTGGT CCGGTGGAAT CTGAGAAACA CTGTTCCTAG TACGTCCTTA 
5051 TTGAAAGTGA CACGTTTTTC CCAGAAATTG ATTTGGGGAA ATATAAACTT 

AACTTTCACT GTGCAAAAAG GGTCTTTAAC TAAACCCCTT TATATTTGAA 
5101 CTCCCAGAAT ACCCAGGCGT CCTCTCTGAG GTCCAGGAGG AAAAAGGCAT 

GAGGGTCTTA TGGGTCCGCA GGAGAGACTC CAGGTCCTCC TTTTTCCGTA 
5151 CAAGTATAAG TTTGAAGTCT ACGAGAAGAA AGACTAACAG GAAGATGCTT 

GTTCATATTC AAACTTCAGA TGCTCTTCTT TCTGATTGTC CTTCTACGAA 

52 01 TCAAGTTCTC TGCTCCCCTC CTAAAGCTAT GCATTTTTAT AAGAC CATGG 

AGTTCAAGAG ACGAGGGGAG GATTTCGATA CGTAAAAATA TTCTGGTACC 
5251 GACTTTTGCT GGCTTTAGAT CTCTTTGTGA AGGAACCTTA CTTCTGTGGT 
CTGAAAACGA CCGAAATCTA GAGAAACACT TCCTTGGAAT GAAGACACCA 

53 01 GTGACATAAT TGGACAAACT ACCTACAGAG ATTTAAAGCT CTAAGGTAAA 

CACTGTATTA ACCTGTTTGA TGGATGTCTC TAAATTTCGA GATTCCATTT 

53 51 TATAAAATTT TTAAGTGTAT AATGTGTTAA ACTACTGATT CTAATTGTTT 

ATATTTTAAA AATTCACATA TTACACAATT TGATGACTAA GATTAACAAA 
5401 GTGTATTTTA GATTC CAAC C TATGGAACTG ATGAATGGGA GCAGTGGTGG 
CACATAAAAT CTAAGGTTGG ATACCTTGAC TACTTACCCT CGTCACCACC 

54 51 AATGCCTTTA ATGAGGAAAA CCTGTTTTGC T C AG AAG AAA TGCCATCTAG 

TTACGGAAAT TACTCCTTTT GGACAAAACG AGTCTTCTTT ACGGTAGATC 
5501 TGATGATGAG GCTACTGCTG ACTCTCAACA TTCTACTCCT CCAAAAAAGA 

ACTACTACTC CGATGACGAC TGAGAGTTGT AAGATGAGGA GGTTTTTTCT 
5551 AGAGAAAGGT AGAAGACCCC AAGGACTTTC CTTCAGAATT GCTAAGTTTT 

TCTCTTTCCA TCTTCTGGGG TTCCTGAAAG GAAGTCTTAA CGATTCAAAA 
56 01 TTGAGTCATG CTGTGTTTAG TAATAGAACT CTTGCTTGCT TTGCTATTTA 

AACTCAGTAC GACACAAATC ATTATCTTGA GAACGAACGA AACGATAAAT 
56 51 CACCACAAAG GAAAAAGCTG CACTGCTATA CAAGAAAATT ATGGAAAAAT 

GTGGTGTTTC CTTTTTCGAC GTGACGATAT GTTCTTTTAA TACCTTTTTA 

5 701 ATTCTGTAAC CTTTATAAGT AGGCATAACA GTTATAATCA TAACATACTG 

TAAGACATTG GAAATATTCA TCCGTATTGT CAATATTAGT ATTGTATGAC 
5751 TTTTTTCTTA CTCCACACAG GCATAGAGTG TCTGCTATTA ATAACTATGC 
AAAAAAGAAT GAGGTGTGTC CGTATCTCAC AGACGATAAT TATTGATACG 
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5801 TCAAAAATTG TGTACCTTTA GCTTTTTAAT TTGTAAAGGG GTTAATAAGG 

AGTTTTTAAC ACATGGAAAT CGAAAAATTA AACATTTCCC CAATTATTCC 
5851 AATATTTGAT GTATAGTGCC TTGACTAGAG ATCATAATCA GCCATACCAC 

TTATAAACTA CATATCACGG AACTGATCTC TAGTATTAGT CGGTATGGTG 
5901 ATTTGTAGAG GTTTTACTTG CTTTAAAAAA CCTCCCACAC CTCCCCCTGA 

TAAACATCTC CAAAATGAAC GAAATTTTTT GGAGGGTGTG GAGGGGGACT 
5951 ACCTGAAACA TAAAATGAAT GCAATTGTTG TTGTTAACTT GTTTATTGCA 

TGGACTTTGT ATTTTAC TTA CGTTAACAAC AACAATTGAA CAAATAACGT 
6001 GCTTATAATG GTTACAAATA AAGCAATAGC ATCACAAATT TCACAAATAA 

CGAATATTAC CAATGTTTAT TTCGTTATCG TAGTGTTTAA AGTGTTTATT 
6051 AGCATTTTTT TCACTGCATT CTAGTTGTGG TTTGTCCAAA CTCATCAATG 

TCGTAAAAAA AGTGACGTAA GATCAACACC AAACAGGTTT GAGTAGTTAC 
6101 TATCTTATCA TGTCTGGATC GGCTGGATGA TCCTCCAGCG CGGGGATCTC 

ATAGAATAGT ACAGACCTAG CCGACCTACT AGGAGGTCGC GCCCCTAGAG 
6151 ATGCTGGAGT TCTTCGCCCA CCCCAACTTG TTTATTGCAG CTTATAATGG 

TACGACCTCA AGAAGCGGGT GGGGTTGAAC AAATAACGTC GAATATTACC 
62 01 TTACAAATAA AGCAATAGCA TCACAAATTT CACAAATAAA GCATTTTTTT 

AATGTTTATT TCGTTATCGT AGTGTTTAAA GTGTTTATTT CGTAAAAAAA 

62 51 CACTGCATTC TAGTTGTGGT TTGTCCAAAC TCATCAATGT ATCTTATCAT 

GTGACGTAAG ATCAACACCA AACAGGTTTG AGTAGTTACA TAGAATAGTA 

63 01 GTCTGTATAC CGTCGACCTC TAGCTAGAGC TTGGCGTAAT CATGGTCATA 

CAGACATATG GCAGCTGGAG ATCGATCTCG AACCGCATTA GTACCAGTAT 

63 51 GCTGTTTCCT GTGTGAAATT GTTATCCGCT CACAATTCCA CACAACATAC 

CGACAAAGGA CACACTTTAA CAATAGGCGA GTGTTAAGGT GTGTTGTATG 

64 01 GAGCCGGAAG CATAAAGTGT AAAGCCTGGG GTGCCTAATG AGTGAGCTAA 

CTCGGCCTTC GTATTTCACA TTTCGGACCC CACGGATTAC TCACTCGATT 
6451 CTCACATTAA TTGCGTTGCG CTCACTGCCC GCTTTCCAGT CGGGAAACCT 

GAGTGTAATT AACGCAACGC GAGTGACGGG CGAAAGGTCA GCCCTTTGGA 
6501 GTCGTGCCAG CTGCATTAAT GAATCGGCCA ACGCGCGGGG AGAGGCGGTT 

CAGCACGGTC GACGTAATTA CTTAGCCGGT TGCGCGCCCC TCTCCGCCAA 
6551 TGCGTATTGG GCGCTCTTCC GCTTCCTCGC TCACTGACTC GCTGCGCTCG 

ACGCATAACC CGCGAGAAGG CGAAGGAGCG AGTGAC TGAG CGACGCGAGC 
6601 GTCGTTCGGC TGCGGCGAGC GGTAT CAGCT CACTCAAAGG CGGTAATACG 

CAGCAAGCCG ACGCCGCTCG C CATAGTCGA GTGAGTTTCC GCCATTATGC 
6651 GTTATCCACA GAATCAGGGG ATAACGCAGG AAAGAACATG TGAGCAAAAG 

CAATAGGTGT CTTAGTCCCC TATTGCGTCC TTTCTTGTAC ACTCGTTTTC 
6701 GCCAGCAAAA GGCCAGGAAC CGTAAAAAGG CCGCGTTGCT GGCGTTTTTC 

CGGTCGTTTT CCGGTCCTTG GCATTTTTCC GGCGCAACGA CCGCAAAAAG 
6751 CATAGGCTCC GCCCCCCTGA CGAGCATCAC AAAAATCGAC GCTCAAGTCA 

GTATCCGAGG CGGGGGGACT GCTCGTAGTG TTTTTAGCTG CGAGTTCAGT 
6801 GAGGTGGCGA AACCCGACAG GAC TATAAAG ATACCAGGCG TTTCCCCCTG 

CTCCACCGCT TTGGGCTGTC CTGATATTTC TATGGTCCGC AAAGGGGGAC 
6851 GAAGCTCCCT CGTGCGCTCT CCTGTTCCGA CCCTGCCGCT TACCGGATAC 

CTTCGAGGGA GCACGCGAGA GGACAAGGCT GGGACGGCGA ATGGCCTATG 
6 901 CTGTCCGCCT TTCTCCCTTC GGGAAGCGTG GCGCTTTCTC AATGCTCACG 

GACAGGCGGA AAGAGGGAAG CCCTTCGCAC CGCGAAAGAG TTACGAGTGC 
6951 CTGTAGGTAT CTCAGTTCGG TGTAGGTCGT TCGCTCCAAG CTGGGCTGTG 

GAGATCCATA GAGTCAAGCC ACATCCAGCA AGCGAGGTTC GACCCGACAC 
7001 TGCACGAACC CCCCGTTCAG CCCGACCGCT GCGCCTTATC CGGTAACTAT 

ACGTGCTTGG GGGGCAAGTC GGGCTGGCGA CGCGGAATAG GCCATTGATA 
7051 CGTCTTGAGT CCAACCCGGT AAGACACGAC TTATCGCCAC TGGCAGCAGC 

GCAGAACTCA GGTTGGGCCA TTCTGTGCTG AATAGCGGTG ACCGTCGTCG 
7101 CACTGGTAAC AGGATTAGCA GAGCGAGGTA TGTAGGCGGT GCTACAGAGT 

GTGACCATTG TCCTAATCGT CTCGCTCCAT ACATCCGCCA CGATGTCTCA 
7151 TCTTGAAGTG GTGGCCTAAC TACGGCTACA CTAGAAGGAC AGTATTTGGT 

AGAACTTCAC CACCGGATTG ATGCCGATGT GATCTTCCTG TCATAAACCA 
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72 01 ATCTGCGCTC TGCTGAAGCC AGTTACCTTC GGAAAAAGAG TTGGTAGCTC 

TAGACGCGAG ACGACTTCGG TCAATGGAAG CCTTTTTCTC AACCATCGAG 
72 51 TTGATCCGGC AAACAAACCA CCGCTGGTAG CGGTGGTTTT TTTGTTTGCA 

AACTAGGCCG TTTGTTTGGT GGCGACCATC GCCACCAAAA AAACAAACGT 
7 3 01 AGCAGCAGAT TACGCGCAGA AAAAAAGGAT CTCAAGAAGA TCCTTTGATC 

TCGTCGTCTA ATGCGCGTCT TTTTTTCCTA GAGTTCTTCT AGGAAACTAG 
7351 TTTTCTACGG GGTCTGACGC TCAGTGGAAC GAAAACTCAC GTTAAGGGAT 

AAAAGATGCC CCAGACTGCG AGTCACCTTG CTTTTGAGTG CAATTCCCTA 
7401 TTTGGTCATG AGATTATCAA AAAGGATCTT CACCTAGATC CTTTTAAATT 

AAACCAGTAC TCTAATAGTT TTTCCTAGAA GTGGATCTAG GAAAATTTAA 
7451 AAAAATGAAG TTTTAAATCA ATCTAAAGTA TATATGAGTA AACTTGGTCT 

TTTTTACTTC AAAATTTAGT TAGATTTCAT ATATACTCAT TTGAACCAGA 
7501 GACAGTTACC AATGCTTAAT CAGTGAGGCA CCTATCTCAG CGATCTGTCT 

CTGTCAATGG TTACGAATTA GTCACTCCGT GGATAGAGTC GCTAGACAGA 
7551 ATTTCGTTCA TCCATAGTTG CCTGACTCCC CGTCGTGTAG ATAACTACGA 

TAAAGCAAGT AGGTATCAAC GGACTGAGGG GCAGCACATC TATTGATGCT 
7601 TACGGGAGGG CTTACCATCT GGCCCCAGTG CTGCAATGAT ACCGCGAGAC 

ATGCCCTCCC GAATGGTAGA CCGGGGTCAC GACGTTACTA TGGCGCTCTG 
7651 CCACGCTCAC CGGCTCCAGA TTTATCAGCA ATAAACCAGC CAGCCGGAAG 

GGTGCGAGTG GCCGAGGTCT AAATAGTCGT TATTTGGTCG GTCGGCCTTC 
7 701 GGCCGAGCGC AGAAGTGGTC CTGCAACTTT ATCCGCCTCC ATCCAGTCTA 

CCGGCTCGCG TCTTCACCAG GACGTTGAAA TAGGCGGAGG TAGGTCAGAT 
7 751 TTAATTGTTG CCGGGAAGCT AGAGTAAGTA GTTCGCCAGT TAATAGTTTG 

AATTAACAAC GGCCCTTCGA TCTCATTCAT CAAGCGGTCA ATTATCAAAC 
7 801 CGCAACGTTG TTGCCATTGC TACAGGCATC GTGGTGTCAC GCTCGTCGTT 

GCGTTGCAAC AACGGTAACG ATGTC CGTAG CACCACAGTG CGAGCAGCAA 
7851 TGGTATGGCT TCATTCAGCT CCGGTTCCCA ACGATCAAGG CGAGTTACAT 

ACCATACCGA AGTAAGTCGA GGCCAAGGGT TGCTAGTTCC GCTCAATGTA 
7 901 GATCCCCCAT GTTGTGCAAA AAAGCGGTTA GCTCCTTCGG TCCTCCGATC 

CTAGGGGGTA CAACACGTTT TTTCGCCAAT CGAGGAAGCC AGGAGGCTAG 
7951 GTTGTCAGAA GTAAGTTGGC CGCAGTGTTA TCACTCATGG TTATGGCAGC 

CAACAGTCTT CATTCAACCG GCGTCACAAT AGTGAGTACC AATACCGTCG 
8001 ACTGCATAAT TCTCTTACTG TCATGCCATC CGTAAGATGC TTTTCTGTGA 

TGACGTATTA AGAGAATGAC AGTACGGTAG GCATTCTACG AAAAGACACT 
8051 CTGGTGAGTA CTCAACCAAG TCATTCTGAG AATAGTGTAT GCGGCGACCG 

GACCACTCAT GAGTTGGTTC AGTAAGACTC TTATCACATA CGCCGCTGGC 
8101 AGTTGCTCTT GCCCGGCGTC AATACGGGAT AATACCGCGC CACATAGCAG 

TCAACGAGAA CGGGCCGCAG TTATGCCCTA TTATGGCGCG GTGTATCGTC 
8151 AACTTTAAAA GTGCTCATCA TTGGAAAACG TTCTTCGGGG CGAAAACTCT 

TTGAAATTTT CACGAGTAGT AACCTTTTGC AAGAAGCCCC GCTTTTGAGA 
82 01 CAAGGAT CTT ACCGCTGTTG AGATCCAGTT CGATGTAACC CACTCGTGCA 

GTTCCTAGAA TGGCGACAAC TCTAGGTCAA GCTACATTGG GTGAGCACGT 
82 51 CCCAACTGAT CTTCAGCATC TTTTACTTTC ACCAGCGTTT CTGGGTGAGC 

GGGTTGACTA GAAGTCGTAG AAAATGAAAG TGGTCGCAAA GACCCACTCG 
8301 AAAAACAGGA AGGCAAAATG CCGCAAAAAA GGGAATAAGG GCGACACGGA 

TTTTTGTCCT TCCGTTTTAC GGCGTTTTTT CCCTTATTCC CGCTGTGCCT 
8351 AATGTTGAAT ACTCATACTC TTCCTTTTTC AATATTATTG AAGCATTTAT 

TTACAACTTA TGAGTATGAG AAGGAAAAAG TTATAATAAC TTCGTAAATA 
8401 CAGGGTTATT GTCTCATGAG CGGATACATA TTTGAATGTA TTTAGAAAAA 

GTCCCAATAA CAGAGTACTC GCCTATGTAT AAACTTACAT AAATCTTTTT 
8451 TAAACAAATA GGGGTTCCGC GCACATTTCC CCGAAAAGTG CCACCTGACG 

ATTTGTTTAT CCCCAAGGCG CGTGTAAAGG GGCTTTTCAC GGTGGACTGC 

Nhel AscI 



8501 TCGACGGATC GGGAGATCTG CTAGCCCGGG TGACCTGAGG CGCGCCGGCT 
AGCTGCCTAG CCCTCTAGAC GATCGGGCCC ACTGGACTCC GCGCGGCCGA 
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8551 TCGAATAGCC AGAGTAACCT TTTTTTTTAA TTTTATTTTA TTTTATTTTT 
AGCTTATCGG TCTCATTGGA AAAAAAAATT AAAATAAAAT AAAATAAAAA 

86 01 GAGATGGAGT TTGGCGCCGA TCTCCCGATC CCCTATGGTC GACTCTCAGT 

CTCTACCTCA AACCGCGGCT AGAGGGCTAG GGGATACCAG CTGAGAGTCA 
8651 ACAATCTGCT CTGATGCCGC ATAGTTAAGC CAGTATCTGC TCCCTGCTTG 
TGTTAGACGA GACTACGGCG TATCAATTCG GTCATAGACG AGGGACGAAC 

87 01 TGTGTTGGAG GTCGCTGAGT AGTGCGCGAG CAAAATTTAA GCTACAACAA 

ACACAACCTC CAGCGACTCA TCACGCGCTC GTTTTAAATT CGATGTTGTT 
8 751 GGCAAGGCTT GACCGACAAT TGCATGAAGA ATCTGCTTAG GGTTAGGCGT 

CCGTTCCGAA CTGGCTGTTA ACGTACTTCT TAGACGAATC CCAATCCGCA 
8 801 TTTGCGCTGC TTCGCGATGT ACGGGCCAGA TATACGCGTT GACATTGATT 

AAACGCGACG AAGCGCTACA TGCCCGGTCT ATATGCGCAA CTGTAACTAA 
8 851 ATTGACTAGT TATTAATAGT AATC 

TAACTGATCA ATAATTATCA TTAG 
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